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FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to isolated polynucleotides encoding 
polypeptides having invertase activity, constructs including same and methods 
of utilizing same. More particularly, the present invention relates to isolated 

25 polynucleotides encoding novel invertases, which polynucleotides can be used 
for substantially increasing the sugar content in for example, fruits, roots, 
leaves, etc., of plants expressing same. In addition, the present invention 
relates to a novel regulatory sequence which when integrated, in a site specific 
manner, into a solanaceae plant genome, can substantially increase the sugar 

30 content in tissues, such as for example, fruits, etc., of the solanaceae plant. 

Ever since the emergence of modern agriculture, agricultured plants 
have been manipulated in an effort to establish crops with agronomically 
important traits. 

Such traits typically include, plant yield and quality, enhanced growth 
35 rates and adaptation to various growth conditions. 
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At present, a great deal of emphasis is placed on the generation of 
plants having desired traits via genetic engineering techniques. However, 
since these traits are often the result of the activity of several genes (referred to 
as quantitative traits), the use of direct gene transfer in manipulating these 
5 traits, is difficult due to problems in pinpointing and then cloning the 
individual loci which contribute predominantly to the expression of the trait. 

As such, genetic manipulation of plants is typically practiced using 
conventional breeding techniques, such as hybrid crossing. 

Although utilizing such breeding techniques typically enables breeders 
io to interogress quantitative trait loci of a specific function into a desirable 
genetic background, such conventional breeding techniques suffer from 
several limitations. 

Oftentimes the "isolation" of a single trait loci (referred to as 
quantitative trait loci or QTL) can be difficult due to linkage drag, or due to 
1 5 effects of epistatic QTLs present in the genetic background or in chromosomal 
association with the single trait loci. 

One example of a plant which has been extensively bred is tomato. 
A major objective in tomato breeding is to increase the content of total 
soluble solids (TSS or brix; mainly sugars and acids) of the fruits in order to 
20 improve taste and processing qualities. 

As such, efforts have been made to interogress the high fruit sugar 
content of wild Lycopersicon species which is three times higher than 
cultivated varieties into cultivated varieties which are characterized by a large 
fruit mass, small foliage, concentrated ripening and other commercially 
desirable trails. 

To try and resolve the genetic basis for the high sugar content of fruits 
of wild Lycopersicon species, Eshed and Zamir (Genetics 141: 1147-1162, 
1995; Genetics 143: 1807-1817, 1996, both arc incorporated herein by 
reference) developed a set of 50 introgression lines from a cross between the 
so green-fruited species L. pennellii and the cultivated tomato, L. escukntum. 



25 
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Each of the lines contained a single RFLP defined L pennellii ^ 
chromosome segment, and together the lines provide complete coverage of the 
tomato genome. Using this resource it was possible to map 23 QTLs that 
regulate brix. 

5 Although this research work presents significant progress in 

determining the QTLs responsible for a high brix value, plants generated by 
introgressing L pennellii into the cultivated tomato, L. esculenlum genetic 
background are of little commercial value since their phenotype, in many 
aspects, is closer to that of the wMLycopersicon species. 

10 Tn order to generate hybrids characterized by a uniform ripening, a good 

cover of the fruit and a high brix value, which hybrids are of high commercial 
value, it is necessary to narrow the introgression described by Eshed and Zamir 
in order to isolate the brix QTL. 

Furthermore, since genetic crossing is genus limited, in order to enable 

15 generation of a high brix value in plants unbreedable with tomato plants, a 
gene or genes responsible for the high brix value in L pennellii must be 
isolated, which gene or genes when introduced and expressed in a plant other 
than tomato substantially increase the fruit brix value thereof. 

Thus, the present invention describes the isolation of polynucleotides 

20 which encode for novel plant invertases which are associated with the high 
brix value in L. pennellii fruit. The present invention further describes 
recombinant methods which utilize these isolated polynucleotides for 
increasing the brix value of plant tissues. 

25 SUMMARY OF THE INVENTION 

According to one aspect of the present invention there is provided an 
isolated nucleic acid comprising a genomic, complementary or composite 
polynucleotide sequence encoding a polypeptide having an invertase activity in 
an apoplastic environment and an N terminal amino acid sequence serving for 

30 secretion into an apoplast. 
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According to further features in preferred embodiments of the invention 
described below, the polypeptide is at least 80 % homologous to SEQ ID 
NO:5, as determined using the BestFit software of the Wisconsin sequence 
analysis package, utilizing the Smith and Waterman algorithm, where gap 

5 creation penalty equals 8 and gap extension penalty equals 2. 

According to Still further features in the described preferred 
embodiments the polynucleotide is at least 80 % identical with SEQ ID NO:6 
as determined using the BestFit software of the Wisconsin sequence analysis 
package, utilizing the Smith and Waterman algorithm, where gap weight 

,o equals 50, length weight equals 3, average match equals 10 and average 

mismatch equals -9. 

According to another aspect of the present invention there is provided 
an isolated nucleic acid comprising a genomic, complementary or composite 
polynucleotide sequence encoding a polypeptide having an invertase activity, 
l5 the polypeptide is at least 80 % homologous to SEQ ID NO:5, as determined 
using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where gap creation penalty 
equals 8 and gap extension penalty equals 2. 

According to further features in preferred embodiments of the invention 
2 o described below, the polynucleotide is hybridizable with SEQ ID NOsrl, 4 or 6 
under hybridization conditions of hybridization solution containing 10 % 
dcxtranc sulfate, 1 M. NaCI, 1 % SDS and 5 x 10° cpm 32 p labeled probe, at 
65 °C, with a final wash solution of 0.1 x SSC and 0.1 % SDS and final wash 
at 60 °C. 

25 According to still further features in the described preferred 

embodiments the polynucleotide is at least 80 % identical with SEQ ID NO:6 
as determined using the BestFit software of the Wisconsin sequence analysis 
package, utilizing the Smith and Waterman algorithm, where gap weight 
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equals 50, length weight equals 3, average match equals 10 .aid average 



According to stil! further features in the described preferred 
embodiments the polypeptide is as set forth in SEQ 10 NO:5 or portions 



According to still further features in the described preferred 
embodiments the polynucleotide is as set forth in SEQ ID NO:6 or portions 



According to yet another aspect of the present invention there is 
10 provided a nucleic acid construct comprising any of the isolated nucleic acids 
described hereinabove. 

According to still further features in the described preferred 
embodiments the nucleic acid construct further comprising a promoter for 
regulating expression of the isolated nucleic acid in an orientation selected 
i s from the group consisting of sense and anlisense orientation. 

According to still further features in the described preferred 
embodiments the nucleic acid construct further comprising a positive and a 
negative selection markers for selecting for homologous recombination events. 
According to still another aspect of the present invention there is 
20 provided a plant cell, tissue or a whole plant comprising any of the nucleic acid 
constructs described herein. 

According to an additional aspect of the present invention there is 
provided a recombinant protein comprising a polypeptide having an invertase 
activity in an apoplastic environment and an N terminal amino acid sequence 
25 serving for secretion into an apoplast. 

According to still further features in the described preferred 
embodiments the polypeptide is at least 80 % homologous to SEQ ID NO:5, as 
determined using the BestFit software of the Wisconsin sequence analysis 
package, utilizing the Smith and Waterman algorithm, where gap creation 
so penalty equals 8 and gap extension penalty equals 2. 



mismatch equals -9. 



5 thereof. 



thereof. 
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According lo still further features in the described preferred 
embodiments the polypeptide includes at least a portion of SEQ TD NO:5. 

According to still ftirthcr features in the described preferred 
embodiments the protein is encoded by a polynucleotide hybridizable with 

5 SEQ TD NOs:l, 4 or 6 or a portion thereof under hybridization conditions of 
hybridization solution containing 10 % dextrane sulfate, 1 M NaCl, 1 % SDS 
and 5 x 10 6 cpm 32 p labeled probe, at 65 °C, with a final wash solution of 0.1 
x SSC and 0.1 % SDS and final wash at 60 °C. 

According to still further features in the described preferred 

10 embodiments the protein is encoded by a polynucleotide at least 80 % identical 
with SEQ ID NO:6 or portions thereof as determined using the BestFit 
software of the Wisconsin sequence analysis package, utilizing the Smith and 
Waterman algorithm, where gap weight equals 50, length weight equals 3, 
average match equals 10 and average mismatch equals -9. 

15 According to still further features in the described preferred 

embodiments the recombinant protein comprising a polypeptide as set forth in 
SEQ ID NO:5. 

According to yet an additional aspect of the present invention there is 
provided a method of increasing a level of a monosaccharide in a plant tissue, 

20 the method comprising the step of expressing in the plant tissue a polypeptide 
having invcrtase activity, wherein the polypeptide is at least 80 % homologous 
to SEQ TD NO:5 as determined using the BestFit software of the Wisconsin 
sequence analysis package, utilizing the Smith and Waterman algorithm, 
where gap creation penalty equals 8 and gap extension penalty equals 2. 

25 According to still an additional aspect of the present invention there is 

provided a method of increasing a level of a monosaccharide in a plant tissue, 
the method comprising the step of expressing a polypeptide having invcrtase 
activity, wherein the polypeptide is encoded by a polynucleotide hybridizable 
with SEQ ID NOs:l, 4 or 6 or a portion thereof under hybridization conditions 
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or hybridization solution containing 10 % dextrane sulfate, 1 M NaCl, 1 % 
SDS and 5 x 10 6 cpm 32 p labeled probe, at 65 °C, with a final wash solution 
of 0.1 x SSC and 0.1 % SDS and final wash at 60 °C. 

According to another aspect of the present invention there is provided a 

5 method of increasing a level of a monosaccharide in a plant tissue, the method 
comprising the step of expressing a polypeptide having invertase activity, 
wherein the polypeptide is encoded by a polynucleotide at least 80 % identical 
with SEQ ID NO:6 as determined using the BestFit software of the Wisconsin 
sequence analysis package, utilizing the Smith and Waterman algorithm, 

10 where gap weight equals 50, length weight equals 3, average match equals 10 
and average mismatch equals -9. 

The present invention successfully addresses the shortcomings of the 
presently known configurations by providing means with which the 
monosaccharide content of plant tissue or organ can be increased. 



BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with 
reference to the accompanying drawings. With specific reference now to the 
drawings in detail, it is stressed that the particulars shown are by way of 

20 example and for purposes of illustrative discussion of the preferred 
embodiments of the present invention only, and are presented in the cause of 
providing what is believed to be the most useful and readily understood 
description of the principles and conceptual aspects of the invention. In this 
regard, no attempt is made to show structural details of the invention in more 

25 detail than is necessary for a fundamental understanding of the invention, the 
description taken with the drawings making apparent to those skilled in the art 
how the several forms of the invention may be embodied in practice. 
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In the drawings: 

FIGs. la-i depict the chromosomal locations, sizes and identities of the 
50 L penneltii introgression lines (ILs) on chromosomes 1-12. The genetic 
map was constructed on the basis of 1 19 BC1 plants as described by Eshed and 

5 Zamir (1995). Mapped markers which are associated with the chromosome of 
a plant line, and markers not assayed on the BC1 map are placed according to 
their approximate positions based on Tanksley ct al. Each line was probed 
with all the markers, and lines showing wild-species alleles are marked with 
bars to the left of the chromosome, c - L. esculentum, p - L. pennellii (Prior 

io art). 

FIG. 2 depicts the digenic interactions between unlinked QTLs. The 
values on the left and at the top of the Figures are the difference (in %) of each 
IL hybrid (ILI-I) from M82 according to Table 2 below. Values in bold are 
significant at p<0.05 (Dunnet's t test). Each histogram represents the 

15 difference (in %) of the hybrid heterozygous for the two introgression from the 
sum of the effect of the two individual lLHs for all traits measured (PW- plant 
weight, FM- fruit mass, B- brix, Y- total fruit yield, BY- the product of B and 
Y). Bars in white show no significant interaction and bars in light gray, gray 
and black indicate significant interactions of p<0.05, pO.Ol and p<0.001, 

20 respectively (prior art). 

FIG. 3 depicts the distribution of the observed and expected numbers of 
pairs of introgressions showing simultaneous significant epistasis (p<0.05) for 
the traits: plant weight (PW), fruit mass (FM), brix (B) and yield (Y). The 
expected values were calculated on the basis of complete independence 

25 between traits and a mean epistatic rate of 0.28 for each trait (prior art). 

FIG. 4 depicts the relationship between the expected an observed values 
for plant weight, fruit mass, brix, yield, and brix x yield of 45 hybrids of two 
ILs. Expected values were calculated on the basis of complete additivity of the 
effects of the individual ILHs. (prior art). 
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FIG. 5 depicts the fine mapping of linked QTLs for B and FM on the 
long ami of chromosome 2. The dark bars represent the L. pennellii 
chromosome segments tntrogressed into M82. Each point is the mean of the 
estimated introgression effect; bars represent the standard errors of the means. 

5 The mean phenotypic value of each line was determined as described in 
example 2 of the Examples section that follows (prior art). 

FIG. 6 is a photograph depicting the fruit size of lines used for the 
mapping analysis of the linked QTL on chromosome 2. Top, IL2-5. Second 
row: left, IL2-5-1; right, 1L2-6-L Third row: left, 1L2-5-3; center, TU-6-6; 

io right, IL2-6-4. Bottom, M82 (prior art). 

FIG. 7 depicts an interaction between 1L9-2-5 and the year grown as 
expressed by plant weight (PW), fruit mass (FM) and brix (B). The values of 
1L9-2-5 and the hybrid ILH9-2-5 over the years 1995, 1996 and 1997 are 
expressed as the percent difference from the isogenic control. M82 (A% of 

15 M82). Results for IL9-2-5 are indicated by the gray bars while results for 
TLH9-2-5 are indicated by the ladder bars. * above the bars denotes a 
significance difference (p<0.01) from the control and * in the d values 
represents a significant (p<0.05) dominance deviation of the heterozygous. For 
the traits showing no yearly dependence (alpha level = 0.01) data from the 

20 three years was pooled to estimate a and d. The mean and the standard 
deviation values for M82 are indicated at the bottom of the Figure; PW- Kg, 
FM-g; B-%. 

FIG. 8a depicts a genetic map of the 1L9-2-5 introgression. The genetic 
distance in centimorgans (cM) is indicated between each pair of markers and is 
25 based on the F2 population. The genotype of 11,9-2-5, IL9-2-6 and 1L9-2-7 is 
represented by a hatched bar (L. pennellii) and an empty bar (L. esculentum). 
The border between the two bars is determined arbitrary between the two 
flanking markers. 



13/03 02 WED 15:14 FAX 972 3 6127575 G. E. EHRLICH IOD7D953 m QlOWi 



10 

HG. 8b depicts the phcnotypk effects of the IL9-2-5 (ladder bar), IL9- 
2-6 (black) and IL9-2-7 (white) hybrids compared to the control M82. * and + 
above the bars denote a significance difference (p<0.05, p<0.1, respectively) 
from the control. Each value represents the mean of eight plots. 

5 FIG. 9 is a scalier plot depicting brix values of two isogenic hybrids, 

M82 x line 202 (17 plants) and IL9-2-5 x line 202 (8 plants). The center lines 
of the means (diamonds) arc the group means, The top and bottom of the 
diamonds form the 95 % confidence intervals for the means. sp9 - a novel 
tomato marker (SEQ ID NO:9 as a probe and EcoKV as a restriction enzyme). 

io FIG. 10 is a schematic depiction of the 11,9-2-5 and IL-9-2-4 

introgrcssions with respect to tomato chromosome 9. 

FIG. 1 1 is a collection of scatter plots depicting the effects of brix 9-2-5 
in the 3-year trial of the indeterminate (glasshouse) NILs. c - L, esculentum, p 
- L pennellii The homozygous IL (pp), containing segment of chromosome 9 f 

15 improved B by 27 percent over the control (ere) with partial dominance for the 
wild species segment (ep) (a=0.5, d=0.25, d/a=0.5). Black arrows and 
horizontal gray lines mark the mean values and the 99.9 % confidence interval 
for each genotype. 

FIG. 12 is a collection of scatter plots depicting the effects of Brix9-2-5 

20 in the 3-year trial of the determinate NILs. ee, pp and ep represent NILs 
homozygous for the L esculentum allele Brix9-2-5 r NILs homozygous for the 
L pennellii allele, and heterozygous NILs, respectively. Black arrows and 
horizontal gray lines mark the mean values and the 99.9 % confidence interval 
lor each genotype. 

25 FIG. 13 depicts the fine-mapping and physical positioning of Brix9-2-5 

on chromosome 9. The upper portion shows the genetic linkage map (in cM) 
of the chromosomal region of Brix9-2-5 y wherein the two end clones of 
BAC91A4, 91N and 9 IS, are indicated in boxes. The mid portion shows the 
genetically ordered markers on BAC91A4 and the number of recombinants 
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between them. The lower portion shows the recombination groups in the 
BAG. Each group in the lower portion is composed of families with a 
common introgressed segment and is represented by a divided bar of hatched 
{L. pennellii) and empty (L esculentum) genomic segments. The borders 

5 between bars are arbitrarily drawn midway between markers positive and 
negative for the introgressed L. pennellii segment. 

FIG. 14a depicts the nucleotide polymorphism (NP) and phenotypic 
analysis of 13 recombinant families of Brix9-2-5. Each NP is represented by a 
nucleotide number and the corresponding L pennellii (top) and L 

io esculentum nucleotides. Full black bars denote a significant phenotypic effect 
(p<0.001). * denotes a verification of recombinant family 6 (- 3 %, nor 
significant) in an F4 generation. 

FIG. 14b depicts the genomic structure of the Lin5 gene. Boxes depict 
exons and the arrows represent the recombination points for each of the 

15 individual recombinant families (numbered as in Figure 14a), the nucleotide 
sequence of the L pennellii Lin5 region spanning the QTL is presented by 
nucleotides 2301-2850, which are numbered to correspond to the start codem 
of Un5. NPs between the two species arc shown in bold and the codons for 
the 3 amino acids substitutions are underlined: positions 2403 (Asp in L 

20 pennellii to Glu in L esculentum), 2457 (Asp to Asn) and 2478 (Val to Leu); 
the intron sequence is depicted by outlined letters. Deleted nucleotides in the 
L, esculentum sequence are boxed and a four bp insertion (ATCT) following 
base 2735 is indicated by a v . The 1 8-bp direct repeat is double underlined and 
the 7 -bp repeats are marked with a wavy line. The start and stop codons of a 

25 hypothetical intraintronal open reading frame are denoted by dashed boxes. 
FIG. 15 depicts the Lin5 exon-intron structure. 

DFSCRIPTION OF THE PREFERRED EMBOD1MHNTS 

The present invention is of isolated polynucleotides encoding novel 
30 plant invertases which can be used to increase the monosaccharide content in 
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plants transformed therewith. Specifically, the present invention can be used 
to increase the monosaccharide content in plant tissues, such as, for example, 
fruits, leaves or roots by expressing at least one of the isolated polynucleotides 
which encode said novel plant invertases within the plant. The present 
5 invention is further of a novel plant expression regulatory element which can 
be used to increase the monosaccharide content in fruits of plants into which 
this regulatory element is genomically integrated in a site specific manner, 
especially in solanaceae. 

The principles and operation of the present invention may be better 
io understood with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it 
is to be understood that the invention is not limited in its application to the 
details of construction and the arrangement of the components set forth in the 
following description or illustrated in the drawings. The invention is capable 
15 of other embodiments or of being practiced or carried out in various ways. 
Also, it is to be understood that the phraseology and terminology employed 
herein is for the purpose of description and should not be regarded as limiting. 

Edible plant tissues, such as fruit, which store a high level of 
monosaccharides are a particularly sought after for both commercial 
20 processing and personal consumption. 

As such, plant breeding techniques are often used by plant breeders in 
order to transfer such a desirable trait into cultivated species. 

However, plant breeding techniques can only be used for related plant 
species and as such this desired trait cannot be transferred between unrelated 
25 plant species. 

As such, the isolation of a gene or genes which are responsible for this 
trait is necessary such that recombinant techniques can be used to introduce 
this trait into a wide range of plants. 

One family of genes which are responsible for monosaccharide 
30 generation in plants are the extracellular invertases. 
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Extracellular invertases enzymes arc hydrolases, cleaving sucrose to 
glucose and fructose, which arc transported into the cells. This activity 
maintains a gradient of assimilates, from the source parts of the plant, to the 
developing sink tissues. Cell wall invertases are synthesized as preproteins, 

5 with a long leader sequence which is cleaved off during transport and protein 
maturation. All known cDNA-derived amino acid sequences of invertases 
possess a signal peptide, required for entry into the endoplasmatic reticulum 
(ER) and, thus, into the secretory pathway. The mature peptide includes the 
NDPNG (SEQ TV) NO:7) and WECPDF (SEQ ID NO:8) sequences which 

10 form the (J-fructosidase motif and the catalytic site, respectively. 

In tomato, the apoplaslic invertasc isoenzymes are encoded by a gene 
family comprising four members: Lin 5, Lin 6, Lin 7 and Lin 8 (Godt and 
Roitch, Plant Physiol. 1 15, 273-282, 1997). The published sequences of this 
gene family are mostly of the third and biggest exon of the gene, exon 3, and 

15 the full sequence of each of these genes remains undetermined. 

As further detailed hcrcinbelow in Examples 3-5 of the Examples 
section below, while reducing the present invention to practice, a carefully 
planned approach using marker selected breeding of tomato plant introgrcssion 
lines (ILs) enabled the determination of a narrow chromosomal region which is 

20 associated with the high level of monosaccharide accumulation (brix value) in 
L pennellii fruits. Sequencing of a bacterial artificial chromosome (BAC) 
which includes this region has revealed the existence of two novel invertase 
genes which display identity to previously published LinS and Lin 7 partial 
cDNA sequences and which are termed herein as eLin5 and eLin7, 

25 respectively. 

Sequencing of a cosmid clone from a L. pennellii genomic library 
revealed the full genomic sequence of the wild species allele which is termed 
herein as pLinS. 
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Comparison to L, esculentum sequences of the same chromosomal 
region has also revealed the existence of a sequence unique to the L. pennelli 
chromosomal sequence. This sequence which is 484 base pairs long spans a 
portion of the genomic polynucleotide sequence of pLin5 which includes a 3 f 
5 portion of exons 3, intron 3 and a 5' portion of exon 4 of pTJnS. 

As further detailed in Example 5 of the Examples section, this sequence 
functions in regulating the expression of Lin5. As is further detailed in the 
Examples provided hereinbelow, the expression products of Lin5 transcript(s) 
function in contributing to the high brix value associated with L. pmnellii 
10 nearly isogenic line(NIL) fruits. 

Thus, according to one aspect of the present invention rhere is provided 
an isolated nucleic acid comprising a genomic, complementary or composite 
polynucleotide sequence encoding a polypeptide having an invertase activity in 
an apoplastic environment and an N terminal amino acid sequence serving for 
15 secretion into an apoplast. 

As used herein in the specification and in the claims section that 
follows, the phrase "complementary polynucleotide sequence 11 includes 
sequences which originally result from reverse transcription of messenger 
RNA using a reverse transcriptase or any other RNA dependent DNA 
20 polymerase. Such sequences can be subsequently amplified in vivo or in vitro 
using a DNA dependent DNA polymerase. 

As used herein in the specification and in the claims section that 
follows, die phrase "genomic polynucleotide sequence" includes sequences 
which originally derive from a chromosome and reflect a contiguous portion of 
25 a chromosome. 

As used herein in the specification and in the claims section that 
follows, the phrase "composite polynucleotide sequence" includes sequences 
which are at least partially complementary and at least partially genomic. 

The phrase "having an invertase activity in an apoplastic environment" 
30 is used herein to distinct cellular invertases from those secreted into the 
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apoplasl. Plant invertases are characterized by their subcellular localization, 
their pH optima and their characterizing isoelectric point, pi. Intracellular 
invertase arc characterized by acidic pH optima and low pi and are thought to 
be in the vacuole, whereas extracellular invertases are also characterized by 
5 acidic optima but a high pi thai enables its bounding to the negatively charged 
cell-wall. Comparison of the known plant invertase genes revealed at least 
two distinguishing motifs: (i) Cell-wall invertases carry the amino acid Proline 
in their (J-iiuctosidasc motif (WECPDF, SEQ ID NO: 8), as is compared to 
Valine in the vacuolar peptide; and (it) in contrast to the cell wall invertases, 

10 the intracellular invertases contain an additional C-terminal extension, which 
might be involved in the vacuolar targeting of the protein. 

According to one preferred embodiment of the present invention, the 
isolated nucleic acid encoding a polypeptide having an invertase activity is at 
least 80 %, at least 85 %, at least 90 %, at least 95 %, at least 98-100 % 

15 homologous to SEQ ID NO:5 as determined using the BestFit software of the 
Wisconsin sequence analysis package, utilizing the Smith and Waterman 
algorithm, where gap creation penalty equals 8 and gap extension penalty 
equals 2, 

As used herein the terms "homology" or "homologous" refer to the 
20 resemblance between compared polypeptide sequences as determined from the 

identity (match) and similarity (amino acids of the same group) between amino 

acids which comprise these polypeptide sequences. 

In addition, or alternatively this isolated nucleic acid is at least 80 %, at 

least 85 %, at least 90 %, at least 95 % identical with SEQ ID NO:6 as 
25 determined using the BestFit software of the Wisconsin sequence analysis 

package, utilizing the Smith and Waterman algorithm, where gap weight 

equals 50, length weight equals 3, average match equals 10 and average 

mismatch equals -9. 
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According to another preferred embodiment of the present invention, 
the isolated nucleic acid is hybridizable with SEQ ID NOs:l, 4 or 6 under 
moderate lo stringent hybridation conditions suitable for polynucleotides 
longer than 200 base pairs. 
5 Hybridization under moderate hybridization conditions is effected by a 

hybridization solution containing 10 % dextrane sulfate, J. M NaCl, 1 % SDS 
and 5 x 10 6 cpm 32 p labeled probe, at 65 °C 5 with a final wash solution of 1 x 
SSC and 0.1 % SDS and final wash at 50 or 55 °C whereas, hybridization 
under stringent hybridization conditions is effected by a hybridization solution 
10 containing 10 % dextrane sulfate, 1 M NaCl, 1 % SDS and 5 x 10<> cpm * 2 p 
labeled probe, at 65 °C P with a final wash solution of 0.1 x SSC and 0.1 % 
SDS and final wash at 60 or 65 6 C 

According to another preferred embodiment of the present invention, 
the isolated nucleic acid encodes a polypeptide whicli is as set forth in SEQ ID 
15 NO:5 Or portions thereof having the inyertase activity. 

This polypeptide sequence which is designated herein as eLin5 (SEQ 
ID NO:5) includes a secretion signal sequence and is expected to have high 
invertasc activity under apoplastic environment conditions. 

According to another preferred embodiment of the present invention the 
20 isolated nucleic acid is as set forth in SEQ ID NO:6 or portions thereof. 

According to another aspect of the present invention there is provided 
an isolated nucleic acid comprising a genomic, complementary or composite 
polynucleotide sequence encoding a polypeptide having an invertase activity, 
the polypeptide is at least 80 %, at least 85 % at least 90 % at least 95 % at 
25 least 98-100 % homologous to SEQ ID NO:5, as determined using the BestFit 
software of the Wisconsin sequence analysis package, utilizing the Smith and 
Waterman algorithm, where gap creation penalty equals 8 and gap extension 
penalty equals 2, 
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According to a preferred embodiment the isolated nucleic acid of this 
aspect of (he present invention is hybridizablc with SEQ ID NOs:l, 4 or 6 
tinder moderate to stringent hybridization conditions. 

According to another preferred embodiment, the isolated nucleic acid of 
5 this aspect of the present invention is at least 80 %.. at least 85 %, at least 90 
%, at least 95 %, at least 98-100 % identical with SHQ ID NO:6 as determined 
using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where gap weight equals 50, 
length weight equals 3, average match equals 10 and average mismatch equals 



According to another preferred embodiment of the present invention the 
isolated nucleic acid of this aspect of the present invention is as set forth in 
SEQ ID NO:6 or portions thereof. 

According to another aspect of the present invention there is provided a 
15 nucleic acid construct including any of the isolated nucleic acid sequences 
mentioned hereinabove. 

The nucleic acid construct according to the present invention can be 
utilized to express the isolated nucleic acid within a plant, plant derived 
tissues, or plant cells either possessing a cell wall or not (protoplasts) 
20 Thus, according to a preferred embodiment of the present invention, the 

nucleic acid construct further includes a promoter for regulating expression of 
the isolated nucleic acid in a sense or antisense orientation. 

Numerous plant functional expression promoters and enhancers which 
can be either tissue specific, developmenlally specific, constitutive or induced 
25 and which can be utilized by the construct of the present invention, some 
examples are provided hereinunder. 

As used herein in the specification and in the claims section that follows 
the phrase "plant promoter" includes a promoter which can direct gene 
expression in plant cells (including DNA containing organelles). Such a 
30 promoter can be derived from a plant, bacterial, viral, fungal or animal origin. 



10 



-9. 
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Such a promoter can be constitutive, i.e., capable of directing high level of 
gene expression in a plurality of plant tissues, tissue specific, i.e., capable of 
directing gene expression in a particular plant tissue or tissues, inducible, i.e T , 
capable of directing gene expression under a stimulus, or chimeric, i.e., formed 

5 of portions of at least two di ffercnt promoters. 

Thus, the plant promoter employed can be a constitutive promoter, a 
tissue specific promoter, an inducible promoter or a chimeric promoter. 

Examples of constitutive plant, promoters include, without being limited 
to, CaMV35S and CaMV19S promoters, FMV34S promoter, sugarcane 

10 bacilliform badnavirus promoter, CsVMV promoter, Arabidopsis ACT2/ACT8 
actin promoter, Arabidopsis ubiquitin UBQl promoter, barley leaf thionin 
BTH6 promoter, and rice actin promoter. 

Examples of tissue specific promoters include, without being limited to, 
bean phaseolin storage protein promoter, DLHC promoter, PHS(5 promoter, 

15 zein storage protein promoter, conglutin gamma promoter from soybean. 
AT2S1 gene promoter, ACT 11. actin promoter from Arabidopsis, napA 
promoter from Brass ica napus and potato patatin gene promoter. 

The inducible promoter is a promoter induced by a specific stimuli such 
as stress conditions comprising, for example, light, temperature, chemicals, 

20 drought, high salinity, osmotic shock, oxidant conditions or in case of 
pathogenicity and include, without being limited to, the light-inducible 
promoter derived from the pea rbcS gene, the promoter from the alfalfa rbcS 
gene, the promoters DRE, MYC and MYB active in drought; the promoters 
INT, INPS, prxEa, I la hsp17.7G4 and RD21 active in high salinity and 

25 osmotic stress, and the promoters hsr203J and str246C active in pathogenic 
stress. 

The construct according to the present invention preferably further 
includes an appropriate selectable marker such as for example an antibiotic 
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resistance gene. In a more preferred embodiment according to the present 
invention the construct further includes an origin of replication. 

The construct according to the present invention can be a shuttle vector, 
which can propagate both in E. coli (wherein the construct comprises an 
5 appropriate selectable marker and origin of replication) and be compatible for 
propagation in cells, or integration in the genome, of a plant. The construct 
according lo this aspect of the present invention can be, for example, a 
plasmid, a bacmid, a phagemid, a cosmid, a phage, a virus or an artificial 
chromosome. 

10 There arc various methods of introducing nucleic acid constructs into 

both monocotyledonous and dicotyledcnous plants (Potrykus, L, Annu. Rev. 
Plant. Physiol., Plant. Mol. Biol. (1991) 42:205-225; Shimamoto et al, 
Nature (1989) 338:274-276). Such methods rely on either stable integration of 
the nucleic acid construct or a portion thereof into the genome of the plant, or 

15 on transient expression of the nucleic acid construct in which case these 
seq uences are not inherited by a progeny of the plant 

There are two principle methods of effecting stable genomic integration 
of exogenous nucleic acid sequences such as those included within the nucleic 
acid construct of the present invention into plant genomes: 

20 (i) Agrobacierium-mediatQd gene transfer: Klee et al (1987) Annu. 

Rev. Plant Physiol. 38:467-486; Klce and Rogers in Cell Culture and 
Somatic Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear 
Genes, eds. Schell, J., and Vasil, L. K., Academic Publishers, San Diego, 
Calif (1989) p. 2-25; Gatenby, in Plant Biotechnology, eds. Kung, S. and 

25 Arntzen,C. J., Butterworth Publishers, Boston, Mass. (1989) p. 93-112. 

(ii) direct DNA uptake: Paszkowski el al, in Cell Culture and Somatic 
Cell Genetics of Plants, Vol. 6, Molecular Biology of Plant Nuclear Genes 
eds. Schell, J., and Vasil, r,. K., Academic Publishers, San Diego, Calif. 
(1989) p. 52-68; including methods for direct uptake of DNA into protoplasts, 

30 Toriyama, K. et al (1988) Bio/Technology 6:1072-1074. DNA uptake 
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induced by brief electric shock of plant cells: Zhang et al Plant Cell Rep. 
(1988) 7:379-384. Fromm et al Nature (1986) 319:79U793. DNA injection 
into plant cells or tissues by particle bombardment, Klein et ai 
Bio/Technology (1988) 6:559-563; McCabc et al Bio/Technology (1988) 
5 6:923-926; Sanford, Physiol. Plant. (1990) 79:206-209; by the use of 
micropipette systems: Neuhaus et al, Theor. Appl. Genet. (1987) 75:30-36; 
Neuhaus and Spangenberg, Physiol. Plant. (1990) 79:21.3-217; or by the direct 
incubation of DNA with germinating pollen, DeWct et al in Experimental 
Manipulation of Ovule Tissue, eds. Chapman, G. P. and Mantell, S. H. and 

to Daniels, W. Longman, London, (1985) p. 197-209; and Ohta, Proc. Natl. 
Acad. Sci. USA (1986) 83:715-719. 

The Agrobacterium system includes the use of plasmid vectors that 
contain defined DNA segments that integrate into the plant genomic DNA. 
Methods of inoculation of the plant tissue vary depending upon the plant 

15 species and the Agrobacterium delivery system. A widely used approach is the 
leaf disc procedure which can be performed with any tissue cxplant that 
provides a good source for initiation of whole plant differentiation. Horsch et 
al in Plant Molecular Biology Manual A5, Kluwer Academic Publishers, 
Dordrecht (1988) p. 1-9, A supplementary approach employs the 

20 Agrobacterium delivery system in combination with vacuum infiltration. The 
Agrobacterium system is especially viable in the creation of transgenic 
dtcotyledenous plants. 

There are various methods of direct DNA transfer into plant cells. In 
electroporaiion, protoplasts are briefly exposed to a strong electric field. In 

25 microinjection, the DNA is mechanically injected directly into the cells using 
very small micropipettcs. In microparticle bombardment, the DNA is adsorbed 
on microprojectiles such as magnesium sulfate crystals, tungsten particles or 
gold particles, and the microprojectiles are physically accelerated into cells or 
plant tissues. 
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Following transformation plant propagation is exercised. The most 
common method of plant propagation is by seed. Regeneration by seed 
propagation, however, has the deficiency that due to heterozygosity there is a 
lack of uniformity in the crop, since seeds arc produced by plants according to 

5 the genetic variances governed by Mendelian rules. Basically, each seed is 
genetically different and each will grow with its own specific traits. Therefore, 
ii is preferred that the transformed plant be produced such that the regenerated 
plant has the identical traits and characteristics of the parent transgenic plant. 
Therefore, it is preferred that the transformed plant be regenerated by 

10 micropropagation which provides a rapid, consistent reproduction of the 

transformed plants. 

Transient expression methods which can be utilized for transiently 
expressing the isolated nucleic acid included within the nucleic acid construct 
of the present invention include, but are not limited to, microinjection and 

15 bombardment as described above but under conditions which favor transient 
expression, and viral mediated expression wherein a packaged or unpackaged 
recombinant virus vector including the nucleic acid construct is utilized to 
infect plant tissues or cells such that a propargating recombinant virus 
established therein expresses the non-viral nucleic acid sequence. 

20 Viruses that have been shown to be useful for the transformation of 

plant hosts include CaV, TMV and BV. Transformation of plants using plant 
viaises is described in U.S. Pat. No. 4,855,237 (BGV), EP-A 67,553 (TMV), 
Japanese Published Application No. 63-14693 (TMV), EPA 194,809 (BV), 
EPA 278,667 (BV); and Gluzman, Y. et al., Communications in Molecular 

25 Biology: Viral Vectors, Cold Spring Harbor Laboratory, New York, pp. 172- 
189 (1988). Pseudovirus particles for use in expressing foreign DNA in many 
hosts, including plants, is described in WO 87/06261. 

Construction of plant RNA viruses for the introduction and expression 
of non-viral exogenous nucleic acid sequences in plants is demonstrated by the 

30 above references as well as by Dawson, W. 0. et al., Virology (1989) 
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172:285-292; Takamatsu et al. EMBO 1. (1987) 6:307-311; French .i al 
Science (1986) 231:1294-1297; and Takamatsu et al. FEBS Letters (1990) 
269:73-76. 

When the virus is a DNA virus, the constructions can be made to the 
virus itself. Alternatively, the virus can first be cloned into a bacterial plasmid 
for ease of constructing the desired viral vector with the foreign DNA. The 
virus can then be excised from the plasmid. If the virus is a DNA virus, a 
bacterial origin of replication can be attached to the viral DNA, which is then 
replicated by the bacteria. Transcription and translation of this DNA will 
produce the coal protein which will encapsidate the viral DNA. If the virus is 
an RNA virus, the virus is generally cloned as a cDNA and inserted into a 
plasmid. The plasmid is then used to make all of the constructions. The RNA 
virus is then produced by transcribing the viral sequence of the plasmid and 
translation of the viral genes to produce the coat protein(s) which encapsidate 

15 the viral KNA. 

Construction of plant RNA viruses for the introduction and expression 
in plants of non-viral exogenous nucleic acid sequences such as those included 
in the construct of the present invention is demonstrated by the above 
references as well as in U S. Pat. No. 5,316,931. 

Tn one embodiment, a plant viral nucleic acid is provided in which the 
native coat protein coding sequence has been deleted from a viral nucleic acid, 
a non-native plant viral coat protein coding sequence and a non-native 
promoter, preferably the subgenomic promoter of the non-native coat protein 
coding sequence, capable of expression in the plant host, packaging of the 
recombinant plant viral nucleic acid, and ensuring a systemic infection of the 
host by the recombinant plant viral nucleic acid, has been inserted. 
Alternatively, the coat protein gene may be inactivated by insertion of the non- 
native nucleic acid sequence within it, such that a protein is produced. The 
recombinant plant viral nucleic acid may contain one or more additional non- 
30 native subgenomic promoters. Each non-native subgenomic promoter is 



20 



25 
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capable of transcribing or expressing adjacent genes or nucleic acid sequences 
in the plant host and incapable of recombination with each other and with 
native subgenomic promoters. Non-native (foreign) nucleic acid sequences 
may be inserted adjacent the native plant viral subgenomic promoter or the 
5 native and a non-native plant viral subgenomic promoters if more than one 
nucleic acid sequence is included. The non-native nucleic acid sequences arc 
transcribed or expressed in the host plant under control of the subgenomic 
promoter to produce the desired products. 

In a second embodiment, a recombinant plant viral nucleic acid is 
10 provided as in the first embodiment except that the native coat protein coding 
sequence is placed adjacent one of the non-native coat protein subgenomic 
promoters instead of a non-native coat protein coding sequence. 

In a third embodiment, a recombinant plant viral nucleic acid is 
provided in which the native coat protein gene is adjacent its subgenomic 
15 promoter and one or more non-native subgenomic promoters have been 
inserted into the viral nucleic acid. The inserted non-native subgenomic 
promoters are capable of transcribing or expressing adjacent genes in a plant 
host and are incapable of recombination with each other and with native 
subgenomic promoters. Non-native nucleic acid sequences may be inserted 
20 adjacent the non-native subgenomic plant viral promoters such that said 
sequences are transcribed or expressed in the host plant under control of the 
subgenomic promoters to produce the desired product. 

In a fourth embodiment, a recombinant plant viral nucleic acid is 
provided as in the third embodiment except that the native coat protein coding 
25 sequence is replaced by a non-native coat protein coding sequence. 

The viral vectors arc encapsidated by the coat proteins encoded by the 
recombinant plant viral nucleic acid to produce a recombinant plant virus. The 
recombinant plant viral nucleic acid or recombinant plant virus is used to 
infect appropriate host plants, The recombinant plant viral nucleic acid is 
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capable of replication in the host, systemic spread in the host, and transcription 
or expression of foreign gene(s) in the host to produce the desired protein. 

Thus, according to a preferred embodiment of the present invention the 
polynucleotide or nucleic acid molecule of the present invention farther 
5 includes one or more sequence elements, such as, but not limited to, a nucleic 
acid sequence encoding a transit peptide, an origin of replication for 
propagation in bacterial cells, at least one sequence element for integration into 
a plant's genome, a polyadenylation recognition sequence, a transcription 
termination signal, a sequence encoding a translation start site, a sequence 
io encoding a translation, stop site, plant RMA virus derived sequences, plant 
DNA virus derived sequences, tumor inducing (Tt) plasmid derived sequences 
and a iransposablc clement derived sequence. 

According to another aspect of the present invention there is provided a 
method of increasing a level of a monosaccharide in a plant tissue, the method 
15 comprising the step of expressing in the plant tissue a polypeptide having 
invertase activity, wherein the polypeptide is at least 80 %, at least 85 %, at 
least 90 % at least 95 %, at least 98-100 % homologous to SEQ ID NO:5 as 
determined using the BestFit software of the Wisconsin sequence analysis 
package, utilizing the Smith and Waterman algorithm, where gap creation 
20 penalty equals 8 and gap extension penalty equals 2. 

The polypeptide according to this aspect of the present invention is 
preferably encoded by a polynucleotide which is hybridizable with SEQ ID 
NOs:l, 4 or 6 or a portion thereof under mild or stringent hybridization 
conditions as described above. 
25 To effect expression, this polynucleotide sequence is preferably 

included in nucleic acid construct which also includes a promoter and selection 
markers as described hereinabove. 

It will be appreciated that any of the transformation methods described 
hereinabove can be used to transform a plant or plant tissues with the above 
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described construct, such thai expression of the isolated nucleic acid according 
to any aspect of the present invention is effected. 

Hybridization under mild hybridization conditions is effected by a 
hybridization solution containing 10 % dextrane sulfate, 1 M NaCl, 1 % SDS 
5 and 5 x 1 06 cpm % labeled probe, at 65 °C, with a final wash solution of 2 x 
SSC and 0.1 % SDS and final wash at 48 °C. 

According to another aspect of the present invention there is provided a 
method for determining whether fruits to be produced from solanaccae seeds 
or solanaeeae seedling will contain an amount of monosaccharides above a 
10 predetermined threshold. 

Thus, the present invention describes novel genes encoding apoplastic 
invertase isoenzymes which function cither in combination or individually in 
elevating the monosaccharide of plants expressing same. 

A promoter sequence controlling the expression of Lin5 is also within 
15 the scope of the present invention. Such a promoter resides upstream to Lin5 
and its sequence is included in SEQ ID NO:l (nucleotides 1-4849). 

Additional objects, advantages, and novel features of the present 
invention will become apparent to one ordinarily skilled in the art upon 
examination of the following examples, which are not intended to be limiting. 
20 Additionally, each of the various embodiments and aspects of the present 
invention as delineated hereinabove and as claimed in the claims section below 
finds experimental support in the following examples. 

EXAMPLES 

25 Reference is now made to the following examples, which together with 

the above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures 
utilized in the present invention include molecular, biochemical, 
microbiological and recombinant DNA techniques. Such techniques are 
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thoroughly explained in the literature. See, for example, "Molecular Cloning: 
A laboratory Manual" Sambrook ct at., (1989); "Current Protocols in 
Molecular Biology" Volumes Mil Ausubel, R. M., ed. (1994); Ausubel et al., 
"Current Protocols in Molecular Biology", John Wiley and Sons, Baltimore, 

5 Maryland (1989); Herbal, "A Practical Guide to Molecular Cloning", John 
Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", 
Scientific American Books, New York; Birrcn et al. (eds) "Genome Analysis: 
A Laboratory Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory 
Press, New York (1998); "Cell Biology: A Laboratory Handbook", Volumes I- 

io III Cellis, J. K., ed. (1 994); "Oligonucleotide Synthesis" Gait, M. J., ed. (1984); 
"Nucleic Acid Hybridization" Hames, B. D., and Higgins S. J., eds. (1985); 
"Transcription and Translation" Hamcs, B. D., and Higgins S. J., eds. (1984); 
"A Practical Guide to Molecular Cloning" Perbal, B., (1984) and "Methods in 
Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To 

15 Methods And Applications", Academic Press, San Diego, CA (1990); Marshak 
et al.. "Strategies for Protein Purification and Characterization - A Laboratory 
Course Manual" CSHL Press (1996); "An introduction to genetic analysis"- 
third edition, Suzuki et al., 1986 and "Molecular Dissection of Complex Trails, 
Paterson AH 1998; all of which are incorporated by reference as if fully set 

20 forth herein. Other general references are provided throughout this document. 
The procedures therein are believed lo be well known in the art and are 
provided for the convenience of the reader. All the information contained 
therein is incorporated herein by reference. 

25 EXAMPLE 1 

In previously published results (Eshed and Zamir, 1995, ibid) an L. 
pennellii introgrcssion line (IL) population was designed in order to generate 
QTL-N1I.S. This II - population consisted of 50 lines, each containing a single 
homozygous restriction fragment length polymorphism (RFLP)-defined wild- 

30 species chromosome segment. Together these lines provided complete 
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coverage of the toraato genome and a set of nearly isogenic lines (NILs) to 
their recurrent parent, the processing-tomato cultivar M82 (Rick ct al., TGRC 
stock lists, Rep. Tom. Genet Coop., AS, 53, 1995) (Figure 1). The genetic 
assumption underlying the identification of QTL using the NILs was that any 
phenotypic difference between an IL and its nearly isogenic control plant is 
due to a QTL that resides on die chromosome segment introgrcssed from L. 
pennellii. The minimum number of P <0.05-significant QTL affecting a trait in 
the lLs was calculated on the basis of the following assumptions: (i) each IL 
affecting a quantitative trait carries only a single QTL; and (ii) two 
overlapping introgresstons with a significant effect on a trait (in the same 
direction relative to the control) carry the same QTL. 

Therefore, in the lLs, the maximum number of detectable QTLs is 
approximately 30. Despite this limitation, twice as many QTLs responsible for 
fruit mass (FM) were identified as compared to the previous populations (see 
Table 1 below). The sensitivity of the ILs in identifying QTLs was even more 
pronounced for brix (B), where two to six times as many QTLs were identified 
as compared to the other populations. 



Table J 

The number of significant effects (p<0.05) of wild species QTLs on FM 



B 



Species 


population 
stnwture 


population 
size 


No- of 
FM-QTLs 


No. of 
B-OTLs 


Reference 


I., chmielewskii 


BC1 


237 


6 


4 


Tunbitey ti al. 
Gentries. 
232. 1141. 1992 


L. cheesmanil 


F2 


350 


7 


4 


Genetics. 
127.181.1991 


L. 

pimpinclli folium 


BC1 


257 


7 


3 


Grondittoctal 
Theor. Appl Centt., 
90.225. 1996 


L, cheesmanii 


RI 


97(6 reps.) 


12 


14 


Goldman ct al 
Thvor.Appl. Genet. , 
90. 925. 1995 


L. pennellii 


IL 


50 (6 reps-) 


18 


23 


Kshed and Zvnir 
1995. ibid 



Using the L. pennellii ILs, QTLs were mapped to various chromosome 
segments originating from the wild species. However the effects associated 
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with an introgressed segment could be due to the existence of one or more loci. 

A 60-cM segment on the long arm of chromosome 2 was responsible 
for a 60 % reduction in FM (in homozygoles) relative to the control, M82. 
This chromosomal region apparently harbors QTLs responsible for FM, which 
are common to a number of wild tomato species (Alpert ct al. Theor. Appl. 

Genet., 91, 994. 1995). 

Fine-mapping analysis of recombinant lines for diat region identified 
three linked loci with a similar effect on FM; two of which were placed on a 3 
cM interval. Finer mapping may reveal additional FM QTL in these regions. 

Quantitative effects which appear to be associated with a single locus 
were inferred from cases of rare transgressive segregation. Using (he ILs, 18 
QTLs responsible for FM were identified but in only two cases (1L7-5 and 
TL12-1-1 with introgression sizes of 15 and 4 cM, respectively) alleles of the 
small-fruited wild species were associated with larger fruits. These effects 
were consistent in trials conducted in different years and genetic constitutions 
(Eshed and Zamir, 1996, ibid). 

Several features of the 1L population contributed to its efficiency in 
detecting QTLs, even in cases when only a tew replicates of each genotype 
were evaluated. 

(i) The lines contained single RFLP-defined introgressions, some of 
which produce effects of relatively large magnitude in which most of the 
phenotypic variation between the Nil .8 is associated with the introgressed 
segment. 

(ii) The permanent nature of the lines enabled testing of the 
introgression effects in different years. The results obtained showed high 
reproducibility of the effects of the QTL which were mapped to the different 
introgressed chromosome segments. 

(iii) Elimination of the "overshadowing effect" of major QTLs 
enabled to detect minor QTLs (a major QTL contributes to large phenotypic 
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variation, thereby masking the effects of other QTLs segregating in the same 
population) 

(iv) Elimination of epistatic interactions between unlinked QTLs. 

(v) The simple statistical procedure relied on comparison with a 
5 common control and is therefore less affected by experimental error. 

Gene actions revealed by QTL studies 
A gene action of the QTL detected was determined using the IL 
population described above by comparing the homozygous lLs to hybrids of 

10 the ILs with the recurrent parent. 

The FM and brix qualities were determined by QTLs which were 
intermediates between additivity and dominance. This mode of inheritance is 
in agreement with results obtained by analysis of an F2 generation (Paterson 
Genetics, 127, 1R1, 1991). In contrast, fruit yield (Y) was strongly associated 

15 with overdominance, whereby some of the heterozygous ILs had higher values 
relative to their corresponding homozygous parents. 

Detailed mapping analysis of a chromosome 1 introgression which 
showed overdominance for Y suggested the existence of two eis loci with 
opposing effects. This result was therefore consistent with the pseudo- 

20 overdominance model for heterosis (Crow Heterosis, Gowen, J. W., Eds., Iowa 
State College Press, Ames, 1A, 1952). However for the other heterotic 
introgrcssions, including rfw-i, the issue of the mode of gene action for 
heterosis is still unresolved. It is interesting to note that the wild species used 
for the tomato mapping studies were highly inferior to the cultivated variety 

25 with respect to Y, yet chromosome segments from these species contribute to 
the increased Y of commercially grown varieties. This transgressive 
segregation is frequent for Y and for seedling morphological traits, whereas 
for FM and brix, transgression was rare (De- Vicente et al Genetics, 134, 585, 
1993). 

30 



13/03 02 WED 13:25 FAX 072 3 6127575 G. E. EHRL1CH X0D709E3 «0108fifl 




30 



Reproducibility of the effects of an identified QTL 
Mendelian factors underlying quantitative traits in an interspecific 
tomato cross were compared in F2 and F3 generations of the same population 
(Paterson Genetics, 127, 181. 1991). Of 11 FM QTL identified in both 
s generations in a trial conducted in California, six were significant both in F2 
and F3. Or the five B QTLs, two were significant in both generations. 
Differences between generations can result from interactions with the 
environment and/or may indicate that the resolution power of such populations 
is limited to QTLs with large effects, fn contrast, of 33 yield-associated QTLs 
,o identified in a two-year trial of selected ILs, 28 were significant in both 
experiments (Eshed and Zamir 1996, ibid). 

Association between QTL-NILs and the inirogressed segment 
The use of the L pennelUi TLs to identify QTLs is based on RFLP 
results which indicated that each line contains a single wild-species 
is introgression. However, some of the lines may include small unidentified 
inlrogressions, and these segments may be responsible for the observed 
phenotypic effects. To test whether the difference between the 1L and its nearly 
isogenic control lies solely in the introgressed segment, a simple experiment 
was performed using eight selected ILs. An F2 resulting from a cross between 
20 each IL and M82 was subjected to RFLP analysis, and plants homozygous for 
the cultivated-tomato chromosome segment were compared quantitatively to 
M82. In no case were any differences detected, indicating that the observed 
phenotypic differences (which were verified using the plants carrying the I. 
pennellii introgrcssions) are due to the mapped chromosome segment. 

25 

EXAMPLE 2 
Epistatic interactions 
The study described in Example 1 (Eshed and Zamir, 1995, ibid) served 
as a basis for testing epistatic interactions between QTLs. Thus, 10 ILs were 
30 selected, some of which include QTLs that affect the measured traits in the 
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heterozygous condition in various directions relative to the control the results 
obtained were reported. 

The ten homozygous lLs were crossed in a half diallele mode and the 
phenotypic values of the 45 double heterozygotes were compared to the 
respective single heterozygous ILs and M82. The results which were 
previously reported by Eshed and Zamir (1996, ibid) indicate that QTL 
cpistasis is prevalent and is generally less than additive. 

Phenotype of the selected single introgression ILHs: 
In the complete IL population composed of 50 introgression line 
hybrids (ILHs) which was analyzed for five yicld-associaied traits, 81 of the 
250 ILH x trait combinations (32 %) were significantly different from the 
isogenic controls (p<0.05). For the subset of the 10 ILHs selected for the 
interaction study, 30 of the 50 combinations (60 %) differed significantly from 
the control (Figure 2). This comparison indicates that the 10 ILHs were 
enriched for QTLs affecting the measured traits. 

In this previously reported study, of the 10 ILHs (using the same 
experimental error), 28 of the 30 significant effects were consistent between 
the two experiments (Table 2 below; Y for ILH1-4 and BY for ILH2-6-I were 
not significantly different from the control). 

Table 2 

Mean phenotypic values ofMB2 and the IL hybrids heterozygous for single 

introgressions 



Gcnotyp 
e 


lnlrogressed 
region 'a' 


Number 
of 

replicates 


plain 

weigln 

(tad 


fruit 
mass 
(S) 


brix (0) 


Yield 


brixx 
yield 


M82 


none 


79 


1.82+ 
0.44 


56. I± 
4.9 


4.54± 
0.40 


9.18+ 
1.54 


417± 
82 


II,HM b 


1(CT233-TG71; 
58 cM) 


26 


3.56± 
0.84* 


48.6+ 
6.5* 


5.23+ 
0.48* 


IUS± 
2.34* 


580± 

114* I 


Ti.Hl-4 


1(TG245-TG259; 
35 cM) 


23 


2I0± 
0.44 


58.1* 
5.4 


5.16± 
0.39* 


9.84± 
1.60 


507* 
83 


ILH2-1 


2<R45S- TG276; 
J6cM) 


26 


1.27± 
0.32* 


52.4± 
5.4* 


4.01± 
0.33* 


7.I9± 
1.64* 


289± 
72' 


ILH2-6- 


2(TG91-CT59: 
14cM 


26 


2.68± 
0.46* 


35.2± 
4.5* 


5.30+ 
0.44* 


8.95* 
1.81 


474± 
99 
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TG413; !6cM) 


25 


2.62± 
0.59* 


57.91 
7.1 


5.071 
0.31* 


10.851 
2.34* 


5511 
J.27* 


Tt H7-S 
iLn i J 


7(TG61- 

T0131A; 15 cM) 


26 


2.46± 
0.46* 


61.5+ 
6.3* 


4.83+ 
0-33* 


10.521 
1.45* 


5091 
87* 


ILH9-2- 

5 C 


9( CT283A- 
TG10;9cM) 


25 


2.091 
0.47 


51.71 
6.3* 


5.521 
0.26* 


9.58+ 
1.92 


5321 
122* 


ILH10- 
1 


10( TG230- 
TG285; 37cM) 


24 


1.84+ 
0.33 


46.51 
5.5* 


5.111 
0.3 1* 


8 f 38i 
1.60 


4281 
81 


ILH11- 
1 


1 l(TG497- 

TG523;27cM) 


26 


2.06± 
0.47 


47.5+- 
3.6* 


4.79+ 
0.41 


8.50 
+1.65 


4061 
21 


ILH12- 


12(TU18U- 
ACO-l;4cM) 


27 


1.81± 
i 0.32 


63 3± 
5.4* 


4.70+ 
0.37 


8.96 
±1.21 


4221 
69 



Mean pnentnypie vumc* cum buuiiuuu wviovv^ vi — ~* r 1 

crosses. All means were compared to M82 and the ones marked wilh * are significantly different 
(Dunnci's t-test, p<0.05). Underlined mean values indicate a significant interaction with year (1993 vs., 
1995; 0.0 Kp<0.05). , 
5 - The introgressed regions in the lLHs is indicated by chromosome number, the markers flunking the 
jntfogression and its size in cM according to Tanskley et al. (1 992) Genetics 1 32: 11 4 1 - 1 1 60. 

- 1LH - Hybrid of ILs crossed with M82. 
C - Interaction with year was based on unpublished results from a 1994 trial. 

io The effects of ILH7-5 on PW, FM and brix were found to be significant 

as compared to other previously reported studies, this significance was 
probably due to (he larger number of replicants tested (25 as compared to 6 in 
previous studies). Significant 1LH by year interactions (p<0.05) were detected 
for four of the 50 comparisons (Table 2), These four comparisons were not 

15 significantly different from the control in either of the years. These results 
indicate a high overall reproducibility of the experimental system in different 
years of growth. 

Interactions between unlinked intr agressions: 

The null hypothesis for the interaction analysis was complete additivity 

20 of the effects of the single tntrogression lLHs. Any significant deviation from 
complete additivity was considered as epistasis (Figure 3). 

For example, ILH1-1 increased PW by 95 % compared to M82; 1LH12- 
1-1 reduced PW by 1 % compared to M82. The expected phenotype for the 
hybrid between the two homozygous ILs (ILM and IL12-1-1) is a 94% 

25 increase in PW relative to M82. The observed PW for the hybrid heterozygous 
for the two inlrogressions was 76 % higher than M82, indicating a significant 
interaction (p<0.05). 
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Of the 225 possible interactions (45 hybrids x five traits) 59, 28 and 12 
were significant at the p<0.05, p<0.01 and p<0.001 level respectively (Figure 
3). These values are much higher thaw that expected by chance alone. 

To further characterize the nature of the interactions, the double- 
heterozygous combinations were divided into four groups based on the 
performance of the single ILHs ( Table 3 below). 

Table 3 

Frequency of significant interactions (p<0.05) between unlinked L. pennellii 

introgressions 



20 



25 



Interacting^ 
OIL types 



Sig-Sig (same 
direction) 



Sig-Sig (opposite 
direct i on) 



Sig*NonSi& 



NonSlg-NonSlg. 



' a QTLs were classified accordineio the significance and ihc direction of their effects relative to MS2. 
k Number of significant interactions. 

c Number of tested combinations of two L pennetiii introgressions. 

As is shown by Table 3, of 61 tested introgressions between significant 
QT1 ,s (same direction), 29 (48 %) were significant (p<0.05) indicating that the 
interactions between two significant QTLs of L pennellii affect a trait in the 
same direction. 

Among 32 introgressions between significant (opposite) QTLs, seven 
(22 %) significant interactions were detected indicating that the interaction 
between two significant QTLs of L pennellii affect the trait in opposite 
directions. Six of these interactions involved crosses with 1L2-1 for PW, B and 
BY. The IL2-1 line carries the plciotropic QTL which affected all of these 
traits. The seventh interaction in this group involved that of IL12-1-1 with 1L- 
10-1 for FM, where 11)2-1-1 showed marked transgressive segregation for 
this trait ( Table 2). 



Flam 



3*76 



2/4 



5/25 



1/10 



11/45 



Fruit 
Mass 



8/16 



1/12 



2/16 



0/1 



brix 



12/21 



3/7 



4/16 



0/1 



11/45 



19/45 



Total 
fruit yield 



1/3 



0/3 



5/24 



3/15 



brixxyicld 



5/15 



1/6 



2/21 



1/3 



9/45 



I 9/45 



Sum 



29/61 



7/32 



18/102 



5/30 



59/225 
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Among 102 introgressions between significant and non-significant 

QTLs 18 (18 %) interactions were significant. 

Among the 30 introgressions between non significant QTLs, five (17 

%) significant interactions were found. Overall, 26 % (59/225) of the various 
5 L. pennellii introgressions showed significant interactions and the proportion 

of epistatic effects was highest for significant same direction QTLs. 

To search for general trends in the interaction of QTLs, the observed 

values of the 45 double-heterozygous hybrids were plotted against their 

expected values* (Figure 4). 
10 For all five traits highly significani linear regressions were found, 

indicating the overall additivity of the effects of the independent 

introgressions. Assuming complete additivity between the effects of the 

combined individual introgressions one would expect a regression with a slope 

of 1 . The slopes of the lines for the five traits were significantly lower than 1 
is (ranging from 0.71-0.79), indicating average combined effects which are less 

than additive. 

To further examine the less than additive tend revealed by the 
regression analysis, only the cases of cpistasis between significant QTL 
affecting the traits in the same direction were examined irrespective of whether 

20 the QTL originated from L pennellii or L esculentum. 

Twenty-nine epistatic interactions between L. pennellii introgressions 
were detected. In all cases, the observed means for the double heterozygous 
ILHs were significantly lower than the values expected on the basis of an 
assumption of complete additivity. Seven of the interactions of QTL affecting 

25 the trait in the same direction involved L pennellii introgressions with L. 
esculenium alleles. In these cases (row 2 of Table 3), the L. pennellii 
introgressions affected the trait in an opposite manner to that expected 
according to the parental phenotype (transgressive QTL). Six of the seven 
interactions were less than additive; the only exception was PW for the hybrid 

30 of 1M-1 x IL2-1. In this case the double heterozygous hybrid for the QTL 
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acting in the same direction (ILH1-1) showed a higher mean value than the 
sum of the Lwo independent Q'lT (M82 and IL2-1 x IL1-1). Overall, 35 of the 
36 interactions (97 %) showed less than additive interactions. 



10 



15 



Interactions between linked QTL (chromosome 2): 

Twelve homozygous ILs with different introgression sizes in 
chromosome 2 were evaluated for FM and B. Since 10 of these lines were 
previously tested (Eshed and Zamir 1995, ibid) and no significant interactions 
between year and IL were detected, the results from the two years were pooled. 
Based on the overlapping recombined chromosome segments and the 
phenotypic value of each of the ILs, two B QTL and three FM QTL, 
responsible for a similar reduction in fruit mass, were mapped (Figure 5-6). 
After determining the positions of these QTL, the lines were classified 
according to their postulated genotypes (Table 4). 

Table 4 



Gcnotypic group 3 


Mean brix (B) 
in Brix units 


Mean brix 

A % from M82 


P value of 
interaction 


No QTL b 


4.47 


-0.2 




B2-1 


5.00 


J 1.7 




B2-2 


4.98 


11.6 




B2-1/2-2 


5.37 


1P.9 


0.03 


Genotypic group* 


Mean Fruit mass (1*M) 
in grams 


Mean h'M 

A % from M82 


V value of 
interaction 


No QTL D 


59.5 


0.6 




Fm2-1 


43.0 


-27.3 




Fm2-2 


41.0 


-30.7 




Fm2-3 


42,7 


-27.8 




Fm2- 1/2-2 


30.7 


-48.2 


0.009 


Fm2-2-/2-3 


28.0 


-52.7 


0.03 


Fm2.1/2-2/2-3 


21.0 


-64.6 


<0.0001/<0.0001 c 



0 M82 was included in this group, which includes lines without an L. peuneliii QTL which affects this 
20 trait. 

c The two tested interactions were Hm2-lXFm2-2/2-3 and Fm2-3xFm2-l/2-2 



Epistasis for B and FM was tested by comparing the means of the 
pooled genotypic groups. The single interaction for B was significant and the 
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sum of the effects of the single QTL was higher than the mean value of the 
lines carrying both QTL. The four different tests for FM QTL interactions 
were significant: two of them examined the combined action of a single QTL 
and two examined a single QTL and the remaining pair. The average 

5 diminishing effect for two QTL was 8.5 % compared to 16.2 % for interactions 
involving the three QTL (Table 3). This result suggests that the effect of the 
less than additive epistasis is increased (i.e. the effects are further diminished) 
when more QTL are involved. 

The nearly isogenic nature of the TL population utilized by this study 

io allows the identification of twice as many QTI affecting FM and B as in other 
interspecific studies in tomato (Eshed and Zamir, 1995, ibid). The isogenic 
nature of the 1L population is also responsible for the ability to determine 
epistasis of QTL through the design of experiments with balanced 
representation of the different genotypes. Nearly isogenic lines were 

15 previously demonstrated to be very efficient for the detection of epistasis of 
QTL in Drosophila (Long et al 1995. Genetics 139:1273-1291) and maize 
(Doeblcy et al. 1995, Genetics 141: 333-346). In conventional segregating 
populations (F2/F3, BC and recombinant inbreds) all the QTLs which affect 
the trait are segregating QTLs. Assuming that the less than additive mode of 

20 epistasis detected in this study is common to other tomato crosses, this 
interaction would reduce the effect of individual QTLs. As a consequence, the 
number of significant QTLs would be underestimated. Less than additive 
interactions among QTL ensure that the "loss" of an allele affecting a fitness 
trait will have a minimal effect on the phenotype and that canalization will be 

25 achieved. 

Contrary to past QTL mapping studies that uncovered little evidence for 
epistasis, QTL epistasis is an important component in determining the 
phenotypic value for traits showing continuous variation (Table 3). Of the 93 
combinations of pairs of significant QTLs, 39 % were epistatic at a 
30 significance level of p<0.05. Moreover, a higher frequency of epistasis than 
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expected by chance alone was detected for L pennellii chromosome segments 
that individually did not affect the traits (17 %). 

Thus, the prevalence of epistasis uncovered by this study is consistent 
with the numerous classical studies of quantitative traits and breeding that 
5 show significant overall epistatic effects for quantitative traits detected through 
biometrical genetics. 

EXAMPLE 3 

Separating the positive trait for high brix value from the negative traits of 
io percentage green fruit yield and internodes length through marker assisted 

selection 

As is described in Examples 1 and 2, the hybrid plants obtained from 
introgressing L pennellii into a L. esculentum genetic background detected 
numerous QTLs associated with traits such as brix (B) and fruit mass (FM). 

15 However, these studies failed to isolate the QTL associated with brix from 
other QTLs which are associated with negative traits such as high percentage 
of green fruit yield and long internodes. 

As such, while reducing the present invention to practice a hybrid plant 
(IL9-2-5) resultant from these studies was further introgressed into the genetic 

20 background of an L. esculentum cultivated tomato variety (M82) in efforts to 
isolate the QTL associated with brix from other QTLs responsible for these 
negative traits which are present in IL9-2-5, to thereby obtain a plant line 
bearing fruits characterized by a high sugar content (high brix value) while 
being otherwise si mi far in phenotype to a cultivated tomato. 

25 To estimate the phenotypic variation associated with high brix value, 

near isogenic plants derived from self crossing of M82 and IL9-2-5 and 
hybrids generated from crossing M82 and IL9-2-5, were evaluated over a three 
year period. Figure 7 presents the means of the lested genotypes for total 
soluble-solids (brix, B), plant weight (PW) and fruit mass (FM). 



13/03 02 WED 15:27 FAX 972 3 6127575 



G. E. EHRLICH 



10070923 a O!Olfi 



• 



38 



The 9-2-5 (chromosome 9) inlrogrcssion was responsible for a 
significant reduction (10 %) in FM in 1995 while in the following years its 
effects were not significant. The effect of the introgression on B was 
consistent between the different years; the introgression significantly increased 

5 B from 20 to 32 percent relative to the control showing partial dominance 
(d/a=0.64). In 1995, the introgression increased PW by ten percent compared 
to 70 percent in 1997; yet, the effect of the introgression on B was similar in 
these two years, indicating that PW is not involved in the major pathway 
affecting B. Hybrid high brix value plants that carried the introgression were 

10 more vegetative, with longer internodes and the ripening of the fruit lasted a 
longer period (late variety) and as such are of little commercial value. 

In order to generate hybrids characterized by a uniform and early 
ripening, a good cover of the fruit and a high brix value, which hybrids are of 
high commercial value, it was decided that further narrowing of the 9-2-5 

15 introgression chromosomal region (9 cM) must be effected in order to isolate 
the brix QTL. 

In order to verify if the increase in brix and the negative effects 
described above which are associated with the 9-2-5 introgression are due to 
linkage drag or are simply pleiotropic effects of the QTL, sub-lines of TL9-2-5 

20 (IL9-2-6 and 1L9-2-7) were generated by selfing the IL9-2-5 hybrid and 
screening for recombinants in the introgression. IL9-2-6 and IL9-2-7 carried 
the south (in direction of the centromere) and north (in direction of a telomere) 
part of the introgression, respectively (Figure 8a). Plants of M82 and hybrids 
generated from crossing M82 with the JL9-2-5, IL9-2-6 and 1L9-2-7 plant lines 

25 (termed ILH9-2-5, TLH9-2-6 and ILH9-2-7, respectively) were planted in a 
commercial stand and evaluated for B, FM, vegetation and % of green fruit 
yield as a parameter for the uniformity of the ripening. Figure 8b presents the 
mean effects of the tested hybrids as is compared to the control tomato plant 
M82. The short introgrcssions of IL9-2-7 showed the "negative" phenotype of 

30 IL9-2-5 with high vegetation, longer internodes and late maturity, but had no 
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significant effect on B. IL9-2-6 had a significant increasing effect on B with a 
reduced vegetation and an early and uniform ripening. The three hybrids had 
no significant effects on the FM. 

Thus, these results place the brix QTL in the south part of the 9-2-5 
s introgression (Figure 8a). A hybrid plant (IT.H9-2-6) generated from 
introgressing IL9-2-6 in the M82 genetic background is characterized by fruit 
having an increased sugar content (B) similar to that of the IL9-2-5 hybrid 
plant line, without the undesired trails found in TL9-2-S which are generated by 
genes situated in the northern part of the 9-2-5 introgression (9-2-7). 

10 

EXAMPLE 4 

The study described in Example 3 which was conducted as part of the 
present invention and previously published studies described in Examples 1 
and 2, were performed in a genetic background of determinate tomato lines 

15 that were specifically developed for the processing tomato industry (M82). 
These plants are suitable for -once over" machine harvest due to homozygosity 
for the recessive mutation sp (self pruning) which modifies the developmental 
program of the shoot such that growth is terminated after the production of two 
consecutive inflorescences. 

20 The wild species (green-fruited) and greenhouse cultivated tomatoes are 

indeterminate (Sp') where the shoot follows a uniform developmental program 
of three leaves and an inflorescence throughout the growth (Pnueli ct al. 1998, 
Development 125:1979-1989). Indeterminate greenhouse tomatoes require 
different agricultural practices as is compared to determinate varieties and 

25 therefore constitute a fundamentally different genetic background to test the 
effect of the brix QTL. 

M82 and IL9-2-5 were crossed with an indeterminate greenhouse line 
(202) and the two nearly isogenic indeterminate hybrids were grown in the 
greenhouse and evaluated for B. The introgression was responsible for a 40 

30 percent increase in B with a separation of the values into discrete groups 
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(Figure 9). This result gave a motivation to develop NILs for the chromosome 
9 introgression in Ihe genetic background of line 202. 

The initial material for the introduction of the brix QTL into 
indeterminate background was the IL9-2-4 introgression line (Figure 10). l*bis 

5 introgression extends to the south of the chromosome beyond the 9-2-5 
introgression. This line was selected since it was observed (hat recombinants 
are more efficiently obtained when long introgressions are used in the marker 
assisted selection. After five marker-assisted backcrosses the selfed 
generation of a BC5 plant that was heterozygous for the introgression was 

io grown and the segregating population was subjected to RFLP analysis. The 
results were highly consistent between the determinate and indeterminate 
backgrounds (Figure 11), the homozygous NIL, containing segment of 
chromosome 9, improved B by 27 % over the control with partial dominance 
for the wild species segment (a=0.5, d=0.25, d/a=0.5). Very similar results 

15 were obtained in another growing season (data not presented) confirming that 
the observed effects were independent of environment in the greenhouse. 

Thus, a major brix associated QTL was introgressed into a genetic 
background of an indeterminate greenhouse tomato (202) thus yielding plants 
which are high in brix and which in all other aspects are similar in phenotype 

70 to this indeterminant greenhouse tomato line. In addition, the resultant tomato 
line does not display the undesirable self pruning trait inherent to determinate 
tomato lines specifically developed for the processing tomato industry (M82). 



EXAMPLES 

25 Thus, marker and phenotype assisted introgression studies revealed the 

exisfence of a single chromosome region which includes the high Brix QTL of 
green-fruited tomato fruits. The high Brix QTL (termed Z?r/x9-2-5) was found 
to be associated with the centromeric portion of chromosome 9. 

In order to isolate the gene or genes responsible for this phenotype 

30 further studies were conducted. 
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Materials and Methods: 

Plant material: 

The nearly isogenic lines (NJULs) for the open-field trial were planted in 
Alcko, Israel (14-28 plants per NIL) in a completely randomized pattern. 
5 Agricultural practices and phenotypic measurements were described previously 
(Eshed and Zamir 1995, ibid). Glasshouse trials of the segregating 
recombinant families were conducted in Shekef, Israel during 1997 and 1998 
in a completely randomized design. 

Statistical analysis: 

io Statistical analysis was performed with the JMJP V.3,1 software for 

Macintosh. Mean brix values were compared using the "Fit Y by X" function 
and "Compare with control" with an alpha level of 0.001 (Dunnet, 1955, J, 
Am. Stat. Assoc. 50, 1096-1121). The control phenotypic values were 
obtained using cv. M82 for the open-field trials (Figure 12) and with the 

15 indeterminate line 17 for the glasshouse trials (Figure 1 1). The additive effect 
(a), dominance deviation (d) were calculated as described above in Example 4. 
Mapping of Brix9-2-5 using the recombinant families was done by RFLP 
genotyping and a two-step analysis. In each recombinant family the brix 
phenotypic value for the L. pennellii homozygotcs, was compared to that of 

20 line 17 and expressed as a percentage of the control (Figure 14a). 
Recombinant families containing a common marker-defined L. pennellii 
chromosome segment were grouped and the mean phenotypic effects for the 
groups were calculated (Figures 13 and 14a). 
Nucleic acid analysis: 

25 The different segregating populations were subjected to RFLP analysis 

as previously reported (Eshed and Zamir 1996, ibid). A bacterial artificial 
chromosome designated BAC91A4 was isolated, subcloned, sequenced and 
assembled by the Sequencer software package. The nucleic acid sequence of 
the 13 Kb insert of BAC91A4 is presented in SEQ ID NO:l. DNA of the 

so homozygous recombinants was used as a substrate for PCR, using the primers 
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5 r -TlTGGGCTCATTCAGTCTCA-3' (SEQ ID N0:2) and 5'- 
AAATTGTTCGGCCTCGTT-3 1 (SEQ ID N0:3) in order to amplify a 1,200 
bp portion of the LinS gene (Figure 14b). The PCR products were cloned and 
sequenced using the pGEM-T easy vector by Promega. PCR was performed 
5 using PCR Supermix (Life Technologies) with 35 cycles of 30 sec at 94 °C, 30 
sec at 52 °C and 1 min at 68 °C, followed by 30 min at 68 °C. 

Experimental Results 
For fine mapping of Br ix9-2-5, 7,000 F2 progenies of the NILs hybrids 

10 (described under Example 4) were subjected to RFLP analysis. Of 145 
recombinants identified between the CP44 and TG225 markers (Figure 13), 29 
were further localized between the two ends of a BAC clone (BAC91A4) 
(Figure 13). For each of the 29 recombinant families, 48 sclfcd progenies 
were genotyped with the appropriate segregating markers and analyzed for 

15 brix. On the basis of common introgressed segments in the 29 recombinant 
families, six recombination groups were generated (Figure 13). Group al 
included seven families that contained the L. pennellii segment north of 91H6, 
none of which showed a significant effect on brix. The reciprocal 
recombination group a2 contained the L pennellii segment south of pi 4 and 

20 showed a significant increase in brix. The a groups placed the QTL south of 
91H6. Using the same procedure, groups p and y located Brix9~2-5 between 
H14 and pl4. To narrow the position of Brix9-2-5 7 18-Kb spanning pi 4 and 
H14 was sequenced and used to design different primer pairs that amplified 
polymorphic products (in size or restriction pattern) between the parental lines. 

25 These products were genetically mapped using the 29 recombinants and one of 
these PCR markers (F8785, Figure 13) co-segregated with the brix QTL. This 
1 Kb genomic interval, represented by F8785, was sequenced in both the 
parental types and the recombinants. Based on nucleotide polymorphisms 
(NPs), 13 families were shown to be recombinants within this 1-Kb fragment. 
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The phenotypic effects for each of the 13 families were used to determine the 
location of Brix9-2-S on the NPs map (Figure 14b, SEQ TD NO:4). 
Recombinants 3, 13 and 6 delimited the Brix9-2-S to a regioii south of position 
2324 (Figure 14b) in a manner consistent with the mapping of the rest of the 
5 recombinant families to the north. Recombinant 2 delimited the QTL to the 
region north of position 2808 (Figure 14b, SEQ ID NO:4), a conclusion thai is 
in agreement with the mapping of recombinants 5 (a member of group pi; 
Figure 13) and 29. This NPs mapping positioned the Brix9-2-5 between 
positions 2324 and 2786 of SEQ ID NO:4. 

10 A GeneBank search revealed that the terminal portions of the brix QTL 

484-bp interval contained regions encoding Lycopersicon apoplastic invertase 
(LinS GeneBank Accession number X9I389) which is expressed exclusively in 
flowers and fruits (Godt el al, 1997 Plant Physiol. 115, 273-282) and for 
which a complete nucleic acid sequence is yet to be determined. 

15 A comparison of the genomic DNA sequenced in the present study 

(SEQ ID NO:l) and the cDNA sequence of LinS resolved the genomic 
sequence of LinS (SEQ TD NO:4) which includes six exons that encode the 
invertase protein (SEQ ID NO: 5). The 484-bp interval spans a 3* portion of 
exon 3, intron 3 and the 5 l portion of exon 4 (Figure 14b). LinS is a member of 

20 a small family of genes encoding apoplastic invertases which irreversibly 
cleave sucrose into glucose and fructose. In most plant species, assimilated 
carbon is transported as sucrose. The extracellular inverlases maintain a 
gradient of carbohydrates, from the source parts of the plant, to the developing 
sink tissues. In addition to sucrose hydrolysis, invertase plays a central role in 

25 regulating, amplifying, and integrating different signals that lead to source- 
sink transition (Roitsch, 1999, Curr. Opin. Plant Biol. 2, 198-206). The 
activity of this enzyme changes the sugar influx, and thus alters the expression 
of sugar-responsive genes in a manner that is yet unclear (Sturm and Tang, 
1999, Trends Plant Sci. 4,401-407). 
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The proposed cDNA sequence of the LinS invertase from L. 
esculeniuml (SEQ ID NO:6) is identical in it f s 3' region to the partial LinS 
cDNA sequence. In addition a cDNA library from L pennellii was screened 
and the full length pLin5 cDNA clone was isolated and sequenced. 

The novel invertase gene associated with the brix QTL which was 
isolated as part of this study was designated pLinS. The proposed cDNA 
sequence of this gene displayed a high degree of homology (identity) (977 %) 
between the L. pennelli and L esculentum species, while homology to other 
isolated invertase cDN As was less than 76 % (Table 8). 

Tables 5 below details the nucleotide coordinates for the various 
regions in the eLinS gene (numbers refer to SEQ ID NO:l). 

Table S 

Nucleotide coordinates of the various regions in the LinS Gene 



Nucleotides 


Description 


comments 


i-4849 


Lin5 promoter 




4850 - 5048 


LinS exon I 


4850 - start codon (#1 in the eLinS 
sequence, SEQ ID NOs:4 and 6) 


5049-6332 


Lin5 Intron 1 




6333 -6341 


Lin5 exon 2 


Conserved in plants 


6342-6418 


Lin5 intron 2 




6419-7440 


LinS exon 3 




7441 -7619 


LinS intron 3 


Inc1udea30aaORFin L 


7620-7864 


LinS exon 4 




7865 - 8054 


LinS intron 4 




8055- 8154 


LinS exon 5 




8155- 8285 


LinS intron 5 




8286 - 8670 


LinS exon 6 


8463 - stop codon 



Using blastn (Entrez) the full sequences of the L pennellii pi ,in5 (SEQ 
ID NO:6) were compared to non-redundant (nr) and expressed sequence tag 
(est) libraries. Homologous sequences are presented in Table 6 and the degree 
20 of homology in Table 7 . 
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Table 6 

Sequences displaying homology to the L. pennellii Un5 cDNA 





Nl£> 


Species 


PID 


Blast score (gapped) 


1 


3608172 


Lescutentum 


U 3608I73 


5e-60 


2 


313128 | 


S.tuberosum 


313129 


3e-55 


3 


551258 


NAdbacum 


551259 


6c-47 


4 


170361 


Lesculentum 


170362 




5 


2177080 


L.esculentum 


546937 




6 


2175258 


L.esculenlum 


287474 




7 


TC4315 


Lcscuhnlum 






8 


pLutf 


Lpemieilii 






9 


el.in5 


L.esculentum 






10 


cLml 


Lescutanium 







5 

Table 7 

Pairwise percent identity between cDNAs of the various invertases presented 

in Table 7 





360817 
2 


31312ft 


551258 


170361 


2177080 


21775258 


TC4315 


pLiiiS 


«Lin5 


cl.i»7 


J608I72 




76.0% 


82.9% 


63,1% 


63.2% 


64.2% 


6i.9% 


75.6% 


75.0% 


75.8% 


313128 


76.0% 




76.4% 


62.4% 


fi2..1% 


63.1% 


63.6% 


74.9% 


73.9% 


74.5 % 


551258 


K2.9% 


76.4% 




61.6% 


61.2% 


62.1% 


62.7% 


73.8% 


73.8% 


76,5% 


170361 


63.4% 


62.4% 


61.6% 




97.7% 


98.6% 


68,1% 


65.6% 


64.4% 


62.1% 


21770*11 


63.2% 


62.3% 


6!. 2% 


97.7% 




91.5% 


68.0% 


653% 


64.4% 


61.8% 


21775258 


64.2% 


63.1% 


62.1% 


yft.6% 


91.5% 




68.9% 


66.5% 


65.2% 


62.8% 


TC4315 




63 6% 


627% 




68.0% 


68.9% 




67.7% 


67.2% 


61.9% 


pLinS 


75.6% 


74.9% 


73 8% 


6*6% 


65.*% 


66.5% 


67.7% 




97.7% 


79.6% 


cUn5 


75.0% 


73.9% 


73.8% 


64.4% 


64.4% 


65.2% 


67.2% 


97.7% 




79.7 % 


cUu7 


7S.K% 


74.5 % 


76.5 % 


62. 1 % 


6U% 


62.8 % 


61.9% 


79.6% 


79.7 % 





10 
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Pairwisc percent identity between proteins translated front the cDNA 
sequences of pLin S eLin 5 and eLinl and database sequences 





3608173 


313129 


351259 


1703« 


540937 


287474 


TC43I5 


pLin 5 


cLiaS 


tM»7 


360*173 




77.7% 


83.9% 


45.0% 


14.3% 


44.0% 


ND 


72.5% 


72.2% 


72.9 % 


313129 


77.7% 




79.8% 


43.4% 


43.1% 


39.8% 


ND 


74.3% 


73.1% 


72.7 % 




85.9% 


79.8% 




44.7% 


44.0% 


40.0% 


ND 


74.8% 


743% 


766% 


170**2 


45.0% 


43.4% 


44,7% 




99.1% 


98.0% 


NO 


44.0% 


44.2% 


43.1 % 


546937 


44.3% 


43.1% 


44.0% 


99.1% 




V7.l% 


ND 


43.5% 


43,7% 


414% 


2K7474 


44.0% 


39,8% 


400% 


08.0% 


97.1% 




ND 


40.0% 


40.0% 


38.7 % 


TC43I5 


ND 


ND 


ND 


NO 


ND 


ND 










pLi»5 


77.5% 


74.1% 


74.8% 


44 0% 


43.5% 


40.0% 


Nt) 




96.8% 


77.4 % 


cLin5 


72.2% 


73.1% 


74.3% 


44.2% 


43.7% 


40%% 


ND 


90.8% 




76.7 % 


tLiu7 


72.9 % 


72.7 % 


76.6% 


43.1% 


42.4 % 


38.7 % 


ND 


77.4% 


76.7% 





5 

The proposed translated protein of eLin5 (SEQ ID NO:6) shows a high 
degree of sequence identity to L. Pennellii and L esculentum, while identity to 
other invertase proteins (partial sequences) for both pLin5 and eLin7 was less 
than 75 %. The identity between pLinS and eLin7 was about 77 % (Table 8). 

10 The homology (identity + similarity) between the apoplastic invertases 

(partial sequences) to the eLinS translated protein sequence is 85-86 % (not 
shown), whereas for the vacoular invertases the homology is below the 
detection threshold of the blast search. 

Comparison of the L. pennellii and L. esculentum sequences revealed 

15 several differences that may be responsible for the effect of Brix9-2-5 (Figure 
14b), 

The L pennellii third intron was longer than its corresponding sequence 
in L esculentum (201 bp vs. 179 bp) and included two 18-bp perfect direct 
repeats, as compared to a difference of one nucleotide between the direct 
20 repeats of L. esculentum. L. pennellii carried a 7-bp triple repeat 5' to the first 
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direct repeat, while in L esculentum both the triple repeats were deleted. 
These repeats may regulate the expression of LinS or other genes, as was 
recently demonstrated for a 73 -bp enhancer with similar structures in rat (Hung 
and Penning, 1999, Mol. Endocrin. 13,1704-1717). 

5 Hie mapping of the brix QTL was facilitated by the nearly isogenic 

nature of the phenotyped segregating populations where all the genetic 
variation for the quantitative Irait was associated with the introgressed 
segment. The recombination hotspot created multiple isogenic chimeric alleles 
that delimited the QTL to a defined sequence. This hotspot, which may be 

io associated with the direct repeats in intron 3, created 13 recombinants within a 
948-bp interval as compared to only 16 recombinants for the rest of the 100- 
Kb BAC. This observation is consistent with studies in maize, where 
intragenic recombination frequencies were found to be several times greater 
than recombination between genes (Dooner and Martinez-Ferez, 1997, Plant 

15 Cell 9, 1633-1646(1997). 

Much of our understanding of development is based on analysis of 
mutants which display a loss of function. However, the variation of greatest 
interest is often quantitatively inherited and originates from natural 
populations. To determine the molecular basis of such traits, it is necessary to 

20 clone the genes and devise molecular and genetic complementation approaches 
sensitive enough to detect minor variations in gene expression pattern and 
function. This study highlights the potential of wild species alleles for 
unraveling novel variations which can be potentially useful to agricultural 
production. 

25 

Although the invention has been described in conjunction with specific 
embodiments thereof, it is evident that many alternatives, modifications and 
variations will be apparent to those skilled in the art. Accordingly, it is 
intended to embrace all such alternatives, modifications and variations that fall 
30 within the spirit and broad scope of the appended claims. All publications, 
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patents, patent applications and sequences identified by their accession 
numbers mentioned in this specification are herein incorporated in their 
entirety by reference into the specification, to the same extent as if each 
individual publication, patent, patent application or sequence identified by 
their accession number was specifically and individually indicated to be 
incorporated herein by reference. In addition, citation or identification of any 
reference in this application shall not be construed as an admission that such 
reference is available as prior ait to the present invention. 



